Senior Site Reliability Engineer

Engineering Bangalore , Koramangala

Job ID: 25-251

Come Join Our Passionate Team! At Barracuda, we make the world a safer place. We believe every business deserves access to cloud-enabled, enterprise-grade security solutions that are easy to buy, deploy, and use. We protect email, networks, data and applications with innovative solutions that grow and adapt with our customers’ journey. More than 200,000 organizations worldwide trust Barracuda to protect them — in ways they may not even know they are at risk — so they can focus on taking their business to the next level.

Envision Yourself at Barracuda:

The Barracuda Central Intelligence team is looking for a highly skilled and passionate Senior Site Reliability Engineer to join our cross functional Agile team. The Barracuda Central Intelligence team is responsible for collecting, aggregating, enriching, and distributing threat intelligence across the company to develop threat detection systems that protect our customers. As a Senior Site Reliability Engineer, you will be responsible for managing production services and will work with Engineering and Operations teams to ensure reliability, scalability, and performance of services. So, if you've got what it takes to excel in this role, we would like to talk to you!

What you’ll be working on:

Write clean, high-performance, and well tested, infrastructure code with a focus on reusability (Puppet /Ansible/ Terraform/Cloudformation)
Recommend and implement infrastructure best practices in alignment with standard SRE principles and supply guidance on system performance and throughput expectations.
Troubleshoot issues across the entire stack: hardware, software, application, and network 
Establish, maintain, and adhere to Barracuda technical standards, policies, and procedures
Build and enhance our observability and reliability systems
Participate in an on-call rotation
Collaborate with internal groups to design, develop, and deploy manageable, scalable, and robust services 
Perform RCA (Root Cause Analysis), partner with engineering and operation teams across the organization to roll out fixes
Provide technical guidance and mentorship to other engineers on reliability and scalability best practices, tools, and methodologies

What you bring to the role:

Experience with developing, building, securing, and operating sophisticated and highly automated Cloud infrastructure in AWS a must
Prior success in automating and maintaining an efficient large scale real-world production environment
Extensive experience with orchestrating cloud infrastructure automation using tools like Terraform and CloudFormation
Development experience with continuous integration (CI/CD) and automation tools such as GitHub, GitHub Actions, Jenkins, Packer, Ansible, Puppet, etc.
Working knowledge with deployment patterns/strategy including blue/green, canary, rolling deployment, draining, etc.
Comprehensive experience with containers and container orchestration tools (Docker) in a Cloud Environment (AWS EKS)
The ability to design, author, and release code in languages like Python
Advanced Operating System skills with knowledge of Linux internals
Extensive experience working with observability and reliability tools like New Relic, Elastic APM (Application Performance Monitoring), CloudWatch, Prometheus and Grafana
Experience with Data pipeline engineering and tools like Databricks, Apache Spark, Kafka, DataStage
Strong debugging skills with a systematic problem-solving approach to identify complex problems
Ability to communicate effectively both verbally and in writing
Self-awareness and a true teamwork spirit
Bachelor's degree in a technology field or equivalent work experience
Minimum of 5 years of experience in a Site Reliability Engineer (SRE) or similar role

What you’ll get from us:

A team where you can voice your opinion, make an impact, and where you and your experience are valued. Internal mobility – there are opportunities for cross training and the ability to attain your next career step within Barracuda. In addition, you will receive equity, in the form of non-qualifying options.

#LI: Hybrid

Apply Apply Later

← Back to Current Openings

Senior Site Reliability Engineer

Share